Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ Parallel Computing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
24742
posts in
23.2
ms
An $\
widetilde
{O} (n^{3/7})$ Round Parallel Algorithm for
Matroid
Bases
🎞️
Tape Combinatorics
arxiv.org
·
6d
Making
Julia
as Fast as C++
💧
Liquid Types
flow.byu.edu
·
5d
·
Hacker News
Stencil
Computations
on Cerebras Wafer-Scale Engine
🌊
Streaming Algorithms
arxiv.org
·
1d
Light
Cone
Consistency: Toward a Unified Theory of Consistency in
Message-Passing
Systems
🎯
Performance Proofs
arxiv.org
·
2h
·
Hacker News
Parallel Lifted Planning via
Semi-Naive
Datalog
Evaluation
🧮
Datalog Engines
arxiv.org
·
1d
Stencil
Computations on
Tenstorrent
Wormhole
🕸️
Mesh Networks
arxiv.org
·
1d
FATE: Future-State-Aware
Scheduling
for
Heterogeneous
LLM Workflows
⚡
Incremental Computation
arxiv.org
·
1d
Constraint-Aware
Execution Planning for Hybrid Space-Ground Compute
Workloads
⚡
Z3 Optimization
arxiv.org
·
5d
Regulating
Branch
Parallelism
in LLM Serving
⚡
Cache Coherence
arxiv.org
·
1d
Lifting to
tensors
when
compiling
scientific computing workloads for AI Engines
🚀
SIMD Parsing
arxiv.org
·
6d
On Similarity of Computational
Kernels
in our Codes and
Proxies
🎯
Performance Proofs
arxiv.org
·
1d
A Scalable Recipe on
SuperMUC-NG
Phase 2: Efficient Large-Scale Training of Language Models
🚀
SIMD Text Processing
arxiv.org
·
1d
Towards Compute-Aware In-Switch Computing for LLMs
Tensor-Parallelism
on Multi-GPU Systems
🦀
Embedded Rust
arxiv.org
·
4d
·
Hacker News
Implementing True
MPI
Sessions and Evaluating
MPI
Initialization
Scalability
🌊
Streaming Systems
arxiv.org
·
6d
On Solving Problems of
Substantially
Super-linear Complexity in $N^{o(1)}$ Rounds in the
MPC
Model
🎯
Performance Proofs
arxiv.org
·
6d
Symmetry-induced
quantum-inspired
parallelism
of classical dynamic systems
⚛️
Quantum Compilers
arxiv.org
·
5d
VDCores
: Resource
Decoupled
Programming and Execution for Asynchronous GPU
🔩
Systems Programming
arxiv.org
·
6d
·
Hacker News
DITRON
: Distributed Multi-level
Tiling
Compiler for Parallel Tensor Programs
⚡
Z3 Optimization
arxiv.org
·
6d
Tackling
the Data-Parallel Load Balancing
Bottleneck
in LLM Serving: Practical Online Routing at Scale
🌊
Streaming Systems
arxiv.org
·
4d
Relay Buffer Independent Communication over
Pooled
HBM for Efficient MoE Inference on
Ascend
⚡
Cache Coherence
arxiv.org
·
4d
No more posts from matmat's subscribed feeds.
Scour all
24963
feeds
Learn more about Feeds
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help